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Direct Cloning and Sequence Analysis of 
Enzymatically Amplified Genomic Sequences 



Stephen J. Scharf, Glenn T. Horn, Henry A. Erlich 

A method is described for directly cloning erizymaticaily amplified segments of 
genomic DNA into an M13 vector for sequence analysis. A 110— base pair fragment of 
the human p-globin gene and a 242-base pair fragment of the human leukocyte 
antigen DQa locus were amplified by the polymerase chain reaction method, a 
procedure based on repeated cycles of denaturation, primer annealing, and extension 
by DNA polymerase I. Oligonucleotide primers with restriction endonuclease sites 
added to their 5' ends were used to facilitate the cloning of the amplified DNA. The 
analysis of cloned products allowed the quantitative evaluation of the amplification 
method's specificity and fidelity. Given the low frequency of sequence errors observed, 
this approach promises to be a rapid method for obtaining reliable genomic sequences 
from nanogram amounts of DNA. 



UNDERSTANDING THE MOLECULAR 
basis of genetic disease or of com- 
plex genetic polymorphisms, such 
as those in the human leukocyte antigen 
(HLA) region, requires detailed nucleotide 
sequence information from a variety of indi- 
viduals to localize relevant variations. Cur- 
rendy, the analysis of each allelic variant 
requires a substantial effort in library con- 
struction, screening, mapping, subcloning, 
and sequencing. We report here a mediod 
for the enzymatic amplification of specific 
segments of genomic DNA and their direct 
cloning into M 1 3 vectors for sequence anal- 
ysis, using modifications of the polymerase 
chain reaction (PGR) (1, 2). This in vitro 
amplification procedure is based on repeated 
cycles of denaturation, oligonucleotide 
primer annealing, and primer extension by 
the Klenow fragment of DNA polymerase, 
and results in an exponential increase in 
copies of the region flanked by the primers. 
This method greatly reduces the number of 
cloned DNA fragments to be screened, cir- 
cumvents the need for full genomic libraries, 
and may allow cloning from nanogram 
quantities of genomic DNA. In addition, 
the cloning and sequencing of PCR- ampli- 
fied DNA is a powerful analytical tool for 
the study of the specificity and fidelity of 
this newly developed technique. 

To develop this technique for genomic 
sequence determination and to analyze the 
individual products of PCR amplification, 
we chose the oligonucleotide primers and 



probes previously described for the diagno- 
sis of sickle cell anemia (2). These primers 
amplify a 1 10-bp segment of the human p- 
hemoglobin gene containing the Hb-S mu- 
tation (Fig. 1). They were modified near 
their 5' ends (Fig. 1) to produce convenient 
restriction sites (linkers) for cloning directly 
into the M13mpl0 sequencing vector (3). 
These modifications did not affect the effi- 
ciency of PCR amplification of the specific 
P-globin segment (Fig. 2B), even though 
the overall pattern of PCR-ampIified prod- 
ucts is different (Fig. 2A). After amplifica- 
tion, the PCR products were cleaved with 
the appropriate restriction enzymes and dia- 
lyzed to remove inhibitors of ligation. These 
fragments were ligated into the M 13 vector, 
transformed into the JM103 host, and plat- 
ed out, and the resulting plaques were 
screened by hybridization with a labeled 
oligonucleotide probe to detect the p-globin 
clones. The plaques were also screened with 
the labeled PCR oligonucleotide primers to 
identify all of the clones containing ampli- 
fied DNA. Individual clones were then se- 
quenced directly by using the dideoxy prim- 
er-extension method (4). 

The incorporation of different restriction 
sites at the termini of the amplified product 
allows digestion of the vector with two 
restriction enzymes, thereby reducing the 
background of blue plaques from vectors 
without inserts to less than 11% (Table 1). 
More than 80% of the clones contained 
DNA inserts with the PCR primer se- 



quences but only about 1% of the clones 
hybridized to the internal p-globin probe. 
These nonglobin fragments presumably rep- 
resent amplifications of other segments of 
the genome. This observation is consistent 
with the gel and Southern blot analysis of 
the PCR-ampiified DNA from a p-globin 
deletion mutant and a normal cell line (Fig. 
2). The similarity of the observed gel pro- 
files reveals that most of the amplified geno- 
mic DNA fragments arise from nonglobin 
templates. Sequence analysis of two of these 
nontarget clones showed that the segments 
between the PCR primer sequences were 
unrelated to the p-globin gene and con- 
tained an abundance of dinucleotide repeats, 
similar to some genomic intergenic spacer 
sequences (5). 

When ten of the clones that hybridized to 
the p-globin probe were sequenced, nine 
proved to be identical to the p-globin gene 
and one contained five nucleotide differ- 
ences but was identical to the 5-globin gene. 
The p-globin PCR primers each have two 
mismatches with the 5-globin sequence 
(Fig. 1). The preferential PCR amplification 
of p-globin relative to 8-globin observed in 
our clonal analysis agrees with the results 
obtained with the oligomer restriction assay 
on the PCR reaction (2, 6). Each of these 
ten sequenced clones contains a segment of 
70 bp originally synthesized from the geno- 
mic DNA template during the PCR amplifi- 
cation process. Since no sequence alterations 
were seen in these clones, the frequency of 
nucleotide misincorporation during 20 cy- 
cles of PCR amplification is less than 1 in 
700. 

To analyze the molecular basis of genetic 
polymorphism and disease susceptibility in 
the HLA class II loci, this approach has been 
extended to the amplification and cloning of 
a 242- bp fragment from the second exon of 
the HLA DQa locus, which exhibits local- 
ized allelic variability. In this case the prim- 
ers, whose sequence is based on conserved 
regions of this exon, contain 5 '-terminal 
restriction sites with no homology to the 
DQa sequence (Fig. 1). The specificity of 
amplification achieved with these primers is 
greater than that achieved with the p-globin 
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A Human p -hemoglobin 

PC03: ACACAACTGTGTTCACTAGC — 
GH18: CTTCTGcagCAACTGTGTTCACTAGC — Met... 
TT AC ATTTG CTTCTG AC A CA ACTGTGTTC ACT AG C A ACCTC A AAC AG AC ACCATGGTG C AC . . . TGCCCTGTGGGGCAAGGTG AACGTGGATGAAGTTGGTGGTG AGGCCC . 
! " .AATGTAAACGAAGACTGTGTTGACACAAGTGATCGTTGGAGTTTGTCTGTGGTACCACGTG. . . ACGGGACACCCCGTTCCACTTGCACCTACTTCAACCACCACTCCGGG . 

CCACTTGCACCTACTTCgAaCAC :GH19 
CCACTTGCACCTACTTCAAC :PC04 



119 bp 



B 



Human HLA DQa 



GH26: gtgctgcaGGTGTAAACTTGTACCAG — 
. . .ACCACGTTGCCTCTTGTGGTGTAAACTTGTACCAGTTTTACGGTCCCTCTGGCCAGTACAC. . . TTAAACGCTACAACTCTACCGCTGCTACCAATGAGGTTCCTGAGGTC . , 
. . .TGGTGCAACGGAGAACACCACATTTGAACATGGTCAAAATGCCAGGGAGACCGGTCATGTG. . . AATTTGCGATGTTGAGATGGCGACGATGGTTACTCCAAGGACTCCAG . , 

GTTGAGATGGCGACGATGGcctaggCAc :GH27 



h 



242 bp 



Fig. I. Oligonucleotide primers for PCR amplification. <A) The nvo centraJ 
sequence lines show both strands of the human p-hcmoglobin gene (5) from 
52 bp upstream of the initiation codon (Met) to 84 bp dow nstream. Above 
and below this sequence are the oligonucleotide primers used to amplify this 
segment of the genome. The 3' end of each primer is marked by an arrow to 
indicate the direction of extension by DNA polymerase. Dors indicate 
nucleotide differences between p- and 6-globin. The primers PC03 and 
PC04 were used in our previous study (/, 2) and are completely homologous 
to the target gene. These sequences were extended and modified to produce 
the linker-primers GH18 and GH19. The modifications (lowercase letters) 
create a Pst I site in GH18 and a Hind III site in GH19. The amplified 



segment is 110 bp with PC03 and PC04, and 119 bp with GH18 and 
GHI9, (B) The central sequences show a segment of the human HLA DQa 
gene (10) from codons 9 to 92 (codons are numbered for the mature 
protein). This region codes for the highly polymorphic outer domain of the 
protein. As above, the oligonucleotide linker- primer sequences used to 
amplify this segment are shown, with lowercase letters designating nonho- 
mologous bases. In this case the desired restriction sites (Pst I for primer 
GH26 and Bam HI for GH27) were added to the 5' end of the primers 
w ithout regard to the target sequence. The length of the amplified product 
from HLA DQa is 242 bp. 



1 2 3 4 5 6 7 8 



1 2 3 4 5 6 7 8 



Table L Distribution of p-globin PCR clones. An 
entire PCR reaction was digested at 37°C with Pst 
I (20 U) and Hind III (20 U) for 90 minutes (for 
P-globin) or Bam HI (24 U) and Pst I (20 U) for 
60 minutes (tor HLA DQa). After phenol extras 
tion (11), the DNA was dialvzed to remove low 
molecular weight inhibitors of ligation (presum- 
ably the deoxynucleosidc triphosphates used in 
PCR), and concentrated by cthanol precipitation. 
All (p-globin) or one tenth (DQa) of the material 
was ligatcd to 0.5 u.g of the cut M 1 3 vector under 
standard conditions (11) and transformed into 
approximately 6 x 10 9 cells of freshly prepared, 
competent JM103 (3) in a total volume of 200 uX 
Fresh JM 103 culture (150 u.1) was then mixed 
with 10 to 30 u.1 of the transformed cells, plated 
on IPTG/X-gal agar plates (3), and incubated 
overnight. The plates were scored for blue (paren- 
tal) plaques and lifted onto BioDync A filters (3). 
These filters were hybridized (2) either with the 
labeled PC04 oligonucleotide to visualize all of 
the clones containing PCR-amplificd DMA 
(primer plaques) or with the RS24 oligonucleo- 
tide probe [the exact complement of the RS06 (6) 
probe] to specifically visualize the clones contain- 
ing hemoglobin sequences (globin plaques). Ten 
clones from the latter category were sequenced by 
the didcoxy extension method (4). Nine were 
identical to the expected p-hemoglobin target 
sequence and one was identical to the homolo- 
gous region of the human 8-hcmoglobin gene (5). 



Category 


Number 


Frequency 
(%) 


Total plaques 


1496 


100 


White plaques 


1338 


89 


Primer plaques ■ 


1206 


81 


Globin plaques 


15 


I 




242 bp - 



■119 bp- 




Globin 



DQa 



Fig. 2. PCR amplification with oligonucleotide linker-primers. DNA was amplified by mixing 1 u.g of 
genomic DNA in polymerase buffer (2) with 100 pmol of each primer. Samples were subjected to 20 
cvcles of PCR, each consisting of 2 minutes of denaturation at 95°C, 2 minutes of cooling to 37°C, and 
2 minutes of polymerization with 1 U of Klcnow DNA polymerase. After amplification, the DNA was 
concentrated bv cthanol precipitation and half of the total reaction subjected to electrophoresis on a gel 
of 4% NuSieve agarose in tris-boratc-EDTA buffer. The gel was photographed, and the DNA was 
transferred to Genatran nylon membrane and hybridized (2) to a labeled probe specific for the target 
sequence. The blot was then washed and autoradiographed. For the amplification of p-globin, the 
starring DNA was cither from the Molt-4 cell line or from the globin deletion mutant GM2064 (2). For 
lane 5,"thc starting material was 1 1 pg of the p-globin recombinant plasmid pBR328::p A (6), the molar 
equivalent of 5 u,g of genomic DNA. For lane 6, the reaction was performed as in lane 1 except that no 
enzyme was added. To increase the efficiency of amplification of the longer HLA DQa segment, 
dirriethvl sulfoxide was added to 10% and the polymerization was carried out at 37°C for 27 cycles. The 
starting DNA for the amplification of DQa was cither from the consanguineous HLA-typing cell line 
LG-2 or from the HLA class II deletion mutant LCL72 1.180 (7), (A) Ethidtum Bromide-stained get 
showing total amplified products. (Lane 1) Primers PC03 and PC04 on Molt-4 DNA; (lane 2) PC03 
and PC04 on GM2064 DNA; (lane 3) GH18 and GH19 on Molt-4; (lane 4) GH18 and GH19 6n 
GM2064; (lane 5) GH18 and GH19 on pBR328::p A DNA; (lane 6) PC03 and'PC04 on Molt-4, no 
enzyme; (lane 7) primers GH26 and GH27 on LG-2 DNA; (lane 8)' GH26 and GH27 on 
LCL72 1.180 DNA. (B) Southern blots showing specific amplified products. Lanes are numbered as in 
(A). Lanes 1 through 6 were hybridized to the la&toLRSpd oligonucleotide Erobejj2}. Lanes 7 and 8 
were hybridized to a cloned DQa cDNA probe labeled by nick-translation (11). 
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primers, since gel electrophoresis of the 
PCR products reveals a discrete band at 240 
bp absent from the HLA deletion mutant 
(7) (Fig. 2). In addition, hybridization 
screening of the Ml 3 clones from this am- 
plification indicates that about 20% are ho- 
mologous to the DQa probe, an increase of 
20 times over the p-globin amplification. 
The basis for die difference in the specificity 
of amplification, defined as the ratio of 
target to nontarget clones, is not clear. It is 
likelv to reflect the primer sequences and 
their genomic distribution rather than the 
different reaction conditions used (Fig. 2). 
The differences between die sequence of the 
P-globin primers GH18, GH19 and PC03, 
PC04 (Fig. 1) may account for the observed 
change in the gel profile of the PCR-ampIi- 
fied products (Fig. 2, lanes 1 and 2 versus 
lanes 3 and 4), reflecting some difference in 
the amplification of nontarget segments. 

Three HLA DQa PCR clones derived 
from the homozygous typing cell LG2 were 
subjected to sequence analysis (8). Two 
clones were identical to a DQa complemen- 
tary (cDNA) clone from the same cell line. 
One differed by a single nucleotide, indicat- 
ing an error rate of approximately 1/600, 
assuming the substitution occurred during 
the 27 cycles of amplification. This proce- 
dure has also been used to analyze sequences 
from polymorphic regions of the HLA 
DQP and DRp loci. 

Our rapid method for the cloning of 
specifically amplified genomic fragments re- 
lies on the ability of oligonucleotides to 
function as PCR primers with unpaired 
bases near their 5' ends. In the later cycles of 
amplification these primers anneal primarily 
to the amplified products rather than to the 
original genomic sequences, and are there- 
fore fully complementary. The p-globin 
linker-primers GH18 and GH19 appear to 
be approximately as efficient and specific as 
the fully matched primers. The HLA DQa 
linker-primers GH26 and GH27, with even 
more 5' mismatches, show an amplification 
specificity 20 times higher than the p-globin 
primers. A large number of DQa clones 
were obtained from just 100 ng of this 
amplified genomic DNA. Our data suggest 
that the error rate over many cycles of 
amplification is sufficiently low so that reli- 
able genomic sequences can be determined 
directly from PCR amplification and clon- 
ing. 

This procedure offers significant advan- 
tages over standard cloning protocols for the 
analysis of sequence polymorphisms in that 
it circumvents the construction and screen- 
ing of full genomic libraries and could po- 
tentially be initiated from nanogram 
amounts of DNA. It is capable of isolating 
only a limited region of the genome, howev- 



er, and requires sequence information to 
identify conserved primer and probe seg- 
ments. Unlike direct genomic sequencing 
(P), the cloning of amplified DNA allows 
the separation of related genes and alleles 
prior to sequencing and does not require a 
detailed knowledge of adjacent restriction 
sites. The application of oligonucleotide 
linker-primers to introduce specific restric- 
tion sites into PCR-amplified DNA may al- 
so prove useful as a general cloning strategy. 

REFERENCES AND NOTES 

1. F. Faloona and K. B. Mullis, Methods Enzymol., in 
press. 

2. R. K. Saiki et al. % Science 230, 1350 (1985). 

3. J. Messing, Methods EnzymoL 101, 20 (1983). 



IFTG, isopropyl-p-D-thiogaiaao-pyranosidc; X-gal, 
5-bromo4<hloro-3-indoM-p-D-galactosidc. 

4. L. Guo and R. Wu, ibid. 100, 60 (1983). 

5. M. Poncz, E. Schwartz, M. Ballantinc, S. Surrey,/. 
Biol. Chtm. 258, 11599 (1983). 

6. R. Saiki, N. Arnheim, H. Erlich, Biotechnology 3, 
1008 (1985). 

7. H. Erlich, J. S. Lee, J. W. Petersen, T. Bugawan, R. 
DeMars,H«m. Immunol 16, 205 (1986). 

8. T. L. Bugawan et al. y unpublished results. 

9. G. M. Church and W! Gilbert, Ptvc. Nad. Acad. Set. 
U.S.A. 81, 1991 (1984). 

10. C Auffrav et «/., Natun (London) 308, 327 (1984). 

1 1. T. Maniatis, E. F. Fritsch, J. Sambrook, in Molecular 
Cloning (Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, 1982). 

12. We thank C. Long for the sequencing of PCR 
amplified DNA's; C. Levcnson. L. Goda, and D. 
Spasic for the synthesis of oligonucleotides; J. 
Sninsky, R. Saiki, and K. MulJis tor critical review; 
and K. Lcvenson for the preparation of this manu- 
script. 

21 April 1986; accepted 17 July 1986 



Inhibition of Endothelial Regeneration by Type-Beta 
Transforming Growth Factor from Platelets 

Ronald L. Heimark, Daniel R. Twardzik, Stephen M. Schwartz 

Damage to the vessel wall is a signal for endothelial migration and replication and for 
platelet release at the site of injury. Addition of transforming growth factor-beta 
(TGF-p) purified from platelets to growing aortic endothelial cells inhibited [ 3 H] thy- 
midine incorporation in a concentration-dependent manner. A transient inhibition of 
DNA synthesis was also observed in response to wounding; cell migration and 
replication are inhibited during the first 24 hours after wounding. By 48 hours after 
wounding both TGF-p-treated and -untreated cultures showed similar responses. 
Flow micronuorimetric analysis of cell cycle distribution indicated that after 24 hours 
of exposure to TGF-P the cells were blocked from entering S phase, and the fraction of 
cells in Gi was increased. The inhibition of the initiation of regeneration by TGF-p 
could allow time for recruitment of smooth muscle cells into the site of injury by other 
platelet components. 



THE CONTINUITY OF THE VASCUUVR 
endothelium is lost as atherosclerotic 
lesions progress (7). In contrast, 
even in large wounds the endothelium rap- 
idly regenerates both in vitro and in vivo 
(2). This raises the possibility that some 
property of the atheromatous wall or ele- 
ment of the blood interacting with the wall 
at the developing plaque can act to prevent 
the normal processes of endothelial regener- 
ation. An obvious source of such an effect at 
sites of denudation is the platelet. Among 
the components released from platelets at a 
site of vascular injury are a group of growth 
regulatory proteins, including platelet-de- 
rived growth factor (PDGF), a growth fac- 
tor similar to epidermal growth factor 
(EGF), and transforming growth factor- 
beta (TGF-p) (3, 4). 

The endothelium grows as a strictly densi- 
ty-inhibited monolayer. Although endothe- 
lial cell growth requires serum or plasma, 
PDGF and EGF are not required (5). In the 
presence of acid- treated serum, aortic endo- 



thelial cells have been shown to form colo- 
nies in soft agar (6). Similar results were 
obtained from mouse embryo cells (AKR- 
2B) or normal rat kidney (NRK) cells and 
the activity that promotes growth in soft 
agar was identified as TGF-p (4). We report 
here that TGF-P purified from human plate- 
lets can inhibit the endothelial regeneration 
process by inhibiting both replication and 
migration. 

Transforming growth factors were orig- 
inally found in viral-transformed rodent 
cells, and it was postulated that they were 
involved in neoplastic cell growth (7, 8). 
However, their presence in normal tissues, 
including kidney, placenta, and platelets, 
suggests a more general function (4, 9). 
Platelets contain 40 to 100 times as much 
TGF-p as other nonneoplastic tissues. TGF- 
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